UCD : Upper Con dence bound for rooted Directed acyclic graphs
نویسندگان
چکیده
In this paper we present a framework for testing various algorithms that deal with transpositions in Monte-Carlo Tree Search (MCTS). When using transpositions in MCTS, a Direct Acyclic Graph (DAG) is progressively developed instead of a tree. There are multiple ways to handle the exploration exploitation dilemma when dealing with transpositions. We propose parameterized ways to compute the mean of the child, the playouts of the parent and the playouts of the child. We test the resulting algorithms on several games. For all games, original configurations of our algorithms improve on state of the art algorithms.
منابع مشابه
Volume Requirements of 3D Upward Drawings
This paper studies the problem of drawing directed acyclic graphs in three dimensions in the straight-line grid model, and so that all directed edges are oriented in a common (upward) direction. We show that there exists a family of outerplanar directed acyclic graphs whose volume requirement is super-linear. We also prove that for the special case of rooted trees a linear volume upper bound is...
متن کاملAn O(n) Approximation and Integrality Gap for Disjoint Paths and Unsplittable Flow in Undirected Graphs and DAGs
We consider the maximization version of the edge-disjoint path problem (EDP). In undirected graphs and directed acyclic graphs, we obtain an O( √ n) upper bound on the approximation ratio where n is the number of nodes in the graph. We show this by establishing the upper bound on the integrality gap of the natural relaxation based on multicommodity flows. Our upper bound matches within a consta...
متن کاملn) approximation and integrality gap for EDP and UFP in undirected graphs and DAGs
We consider the maximization version of the edge disjoint path problem (EDP). In undirected graphs and directed acyclic graphs, we obtain anO( √ n) upper bound on the approximation ratio where n is the number of nodes in the graph. We show this by establishing the upper bound on the integrality gap of the natural multicommodity flow based relaxation. Our upper bound matches to within a constant...
متن کاملAn O(sqrt(n)) Approximation and Integrality Gap for Disjoint Paths and Unsplittable Flow
We consider the maximization version of the edge-disjoint path problem (EDP). In undirected graphs and directed acyclic graphs, we obtain an O( √ n) upper bound on the approximation ratio where n is the number of nodes in the graph. We show this by establishing the upper bound on the integrality gap of the natural relaxation based on multicommodity flows. Our upper bound matches within a consta...
متن کاملGame chromatic number of graphs
y Abstract We show that if a graph has acyclic chromatic number k, then its game chromatic number is at most k(k + 1). By applying the known upper bounds for the acyclic chromatic numbers of various classes of graphs, we obtain upper bounds for the game chromatic number of these classes of graphs. In particular, since a planar graph has acyclic chromatic number at most 5, we conclude that the g...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011